A Query Classification System based on Snippet Similarity for a One-Click Search

نویسندگان

  • Tatsuya Tojima
  • Takashi Yukawa
  • Alan H Cheetham
  • Dan Ionita
  • Niek Tax
  • Makoto P Kato
  • Matthew Ekstrand-Abueg
  • Virgil Pavlu
  • Tetsuya Sakai
  • Takehiro Yamamoto
  • Mayu Iwata
  • Meng Zhao
  • Kosetsu Tsukuda
  • Yoshiyuki Shoji
  • Hiroaki Ohshima
  • Taku Kudo
  • Kaoru Yamamoto
  • Tomohiro Manabe
  • Kazutoshi Umemoto
  • Soungwoong Yoon
  • Hajime Morita
  • Takuya Makino
  • Hiroya Takamura
  • Kazuya Narita
  • Zhicheng Dou
  • Naoki Orii
  • Young-In Song
  • Daniel E. Rose
چکیده

This paper proposes a query classification system for a one-click search system that uses feature vectors based on snippet similarity. The proposed system targets the NTCIR-10 1CLICK-2 query classification subtask and classifies queries in Japanese and English into eight predefined classes by using support vector machines (SVMs). In the NTCIR-9 and NTCIR-10 tasks, most participants used complex features or rules that depend strongly on language characteristics. The authors propose a new method that uses feature vectors created by using snippet similarities instead of the above mentioned features. In the proposed method, feature vectors have fewer dimensions, provide better generalization, lower language dependency, and reduced computer resources. This method achieved accuracies of 0. 93 for a Japanese task and 0. 91 for an English task.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Discovering Popular Clicks\' Pattern of Teen Users for Query Recommendation

Search engines are still the most important gates for information search in internet. In this regard, providing the best response in the shortest time possible to the user's request is still desired. Normally, search engines are designed for adults and few policies have been employed considering teen users. Teen users are more biased in clicking the results list than are adult users. This leads...

متن کامل

Query Classification System Based on Snippet Summary Similarities for NTCIR-10 1CLICK-2 Task

A query classification system for NTCIR-10 1CLICK-2 is described in this paper. The system classifies queries in Japanese and English into eight predefined classes by using support vector machines (SVMs) for classification. Feature vectors are created based on snippet similarities instead of snippet word frequency. These vectors, which have fewer dimensions than those made from raw words, redu...

متن کامل

An Ensemble Click Model for Web Document Ranking

Annually, web search engine providers spend more and more money on documents ranking in search engines result pages (SERP). Click models provide advantageous information for ranking documents in SERPs through modeling interactions among users and search engines. Here, three modules are employed to create a hybrid click model; the first module is a PGM-based click model, the second module in a d...

متن کامل

How do users describe their information need: Query recommendation based on snippet click model

Query recommendation helps users to describe their information needs more clearly so that search engines can return appropriate answers and meet their needs. State-of-the-art researches prove that the use of users’ behavior information helps to improve query recommendation performance. Instead of finding the most similar terms previous users queried, we focus on how to detect users’ actual info...

متن کامل

A New Hybrid Method for Web Pages Ranking in Search Engines

There are many algorithms for optimizing the search engine results, ranking takes place according to one or more parameters such as; Backward Links, Forward Links, Content, click through rate and etc. The quality and performance of these algorithms depend on the listed parameters. The ranking is one of the most important components of the search engine that represents the degree of the vitality...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2013